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Abstract. We characterize the points that satisfy Birkhoff's ergodic 
theorem under certain computability conditions in terms of algorithmic 
randomness. First, we use the method of cutting and stacking to show 
that if an element x of the Cantor space is not Martin-Lof random, there 
is a computable measure-preserving transformation and a computable 
set that witness that x is not typical with respect to the ergodic theorem, 
which gives us the converse of a theorem by V'yugin. We further show 
that if x is weakly 2-random, then it satisfies the ergodic theorem for 
all computable measure-preserving transformations and all lower semi- 
computable functions. 



1. Introduction 

Random points are typical with respect to measure in that they have no 
measure-theoretically rare properties of a certain kind, while ergodic theo- 
rems describe regular measure-theoretic behavior. There has been a great 
deal of interest in the connection between these two kinds of regularity 
recently. We begin by defining the basic concepts in each field and then 
describe the ways in which they are related. Then we present our results 
on the relationship between algorithmic randomness and the satisfaction of 
Birkhoff's ergodic theorem for computable measure-preserving transforma- 
tions with respect to computable (and then lower semi-computable) func- 
tions. Those more familiar with ergodic theory than computability theory 
might find it useful to first read Section [6J a brief discussion of the notion 
of algorithmic randomness in the context of ergodic theory. 

1.1. Algorithmic randomness in computable probability spaces. For 

a general reference on algorithmic randomness, see [7], |g, |2Cj] . We will confine 
our attention to the Cantor space 2 W with the Lebesgue measure A. In light 
of Hoyrup and Rojas' theorem that any computable probability space is iso- 
morphic to the Cantor space in both the computable and measure-theoretic 
senses [l3 j. there is no loss of generality in restricting to this case. 



We present Martin-Lof 's original definition of randomness 191 ]. 

Definition 1.1. An effectively c.e. sequence (V^) of subsets of 2 <UJ is a 
Martin-Lof test if A([Vi]) < 2~ l for every i. If x G 2", we say that x is 
Martin-Lof random if for every Martin-Lof test (Vi), x f\[Vi]. 
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It is easy to see that A(f\[Vi]) = for any Martin-L6f test, and since there 
are only countably many Martin-L6f tests, almost every point is Martin-L6f 
random. 

In Section [H we will also consider weakly 2-random elements of the Cantor 
space. Weak 2-randomness is a strictly stronger notion than Martin-L6f 
randomness and is part of the hierarchy introduced by Kurtz in 

Definition 1.2. An effectively c.e. sequence (V^) of subsets of 2 <UJ is a 
generalized Martin-Ldf test if lim ri _ ) . 00 A([V^]) =0. If x € 2 W , we say that x 
is weakly 2-random if for every generalized Martin-L6f test (Vi), x Hi[Vi]. 

1.2. Ergodic theory. Now we discuss ergodic theory in the general context 
of an arbitrary probability space before transferring it to the context of a 
computable probability space. The following definitions can be found in 



Definition 1.3. Suppose (X, fj,) is a probability space, and let T : X — > X 
be a measurable transformation. 

(1) T is measure preserving if for all measurable A C X, /i(T _1 (A)) = 



(2) A measurable set A C X is invariant under T if T~ l (A) = A modulo 
a set of measure 0. 

(3) T is ergodic if it is measure preserving and every T-invariant mea- 
surable subset of X has measure or measure 1. 

One of the most fundamental theorems in ergodic theory is Birkhoff's 
Ergodic Theorem: 

Birkhoff's Ergodic Theorem. [5] Suppose that (X,n) is a probability 
space and T : X — > X is measure preserving. Then for any f € L\{X) and 
almost every x G X, 



converges. Furthermore, if T is ergodic then for almost every x this limit is 



If we restrict ourselves to a countable collection of functions, this theo- 
rem gives a natural notion of randomness — a point is random if it satisfies 
the conclusion of the ergodic theorem for all functions in that collection. 
In a computable measure space, we can take the collection of sets defined 
by a computability-theoretic property and attempt to classify this notion 
of randomness in terms of algorithmic randomness. In particular, we are 
interested in the following property: 

Definition 1.4. Let (X,fi) be a computable probability space, and let T : 
X — > X be a measure-preserving transformation. Let J 7 be a collection of 
functions in L\{X). A point x € X is a weak Birkhoff point for T with 



M (A). 




equal to f f dji. 
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respect to T if for every / € J 7 , 

lim I^/CT(z)) 



n— >oo n 



converges, x is a Birkhoff point if additionally 

lim -V /(?>))= / /d/x. 

j<71 

The definition of a Birkhoff point is only appropriate when T is ergodic; 
when T is nonergodic, the appropriate notion is that of a weak Birkhoff 
point. 

There are two natural dimensions to consider: the ergodic-theoretic be- 
havior of T and the algorithmic complexity of C. The case where T is ergodic 
has been largely settled. 

A point is Martin-L6f random if and only if the point is Birkhoff for all 
computable ergodic transformations with respect to lower semi-computable 
functions [H, Qjij]. The proof goes by way of a second theorem of ergodic 
theory: 

Poincare Recurrence Theorem ([HI, Chapter 26). Suppose that (X, fi) 
is a probability space and T : X — )■ X is measure preserving. Then for 
all E C X of positive measure and for almost all x € X, T n (x) G E for 
infinitely many n. 

In short, the Poincare Recurrence Theorem says that an ergodic trans- 
formation T returns almost every point to every set of positive measure 
repeatedly, and Birkhoff's Ergodic Theorem says that it will do so with a 
well-defined frequency in the limit. 

A point x 6 I is a Poincare point for T with respect to C if for e very 
E € C with positive measure, T n (x) € E for infinitely many n. In [171 ]. 
Kucera proved that a point in the Cantor space is Martin-L6f random if and 
only if it is a Poincare point for the shift operator with respect to effectively 
closed sets. Later, Bienvenu, Day, Mezhirov, and Shen generalized this result 
and showed that in any computable probability space, a point is Martin- 
Lof random if and only if it is a Poincare point for computable ergodic 
transformations with respect to effectively closed sets The proof that 
Martin-L6f random points are Poincare proceeds by showing that a point 
which is Poincare for any computable ergodic transformation with respect to 
effectively closed sets must also be a Birkhoff point for computable ergodic 
transformations with respect to lower semi-computable functions [H, 10]. 



Similarly, Gacs, Hoyrup, and Rojas have shown that a point is Schnorr 
random if and only if the point is Birkhoff for all computable ergodic trans- 
formations with respect to computable functions [L2|. (Recall that x is 
Schnorr random if x ^ Hj[Vi] for all Martin-L6f tests (Vi) where A([Vi]) = 2~ l 
for all i; Schnorr randomness is a strictly weaker notion than Martin-L6f ran- 
domness [22].) They also consider the case where there are strong mixing 
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Lower semi-computable 


Martin-L6f 
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assumptions on T in addition to being ergodic and show that the equivalence 
with Schnorr randomness still holds. 

In this paper, we consider the analogous situations when T is nonergodic. 
V'yiigin 24|] has shown that if x 6 2 aJ is Martin-L6f random then x is weakly 
Birkhoff for any (not necessarily ergodic) computable measure-preserving 
transformation T with respect to computable functions. Our main result 
is the converse: that if x is not Martin-L6f random then x is not weakly 
Birkhoff for some particular transformation T with respect to computable 
functions (in fact, with respect to computable sets). 

These results can be summarized in Table ll.2i 

This says that a point is weakly Birkhoff for the specified family of com- 
putable transformations with respect to the specified collection of functions 
if and only if it is random in the sense found in the corresponding cell of the 
table. 

We also begin an analysis of the remaining space in the table; we give an 
analog of V'yiigin's result, showing that if x is weakly 2-random then x is a 
weak Birkhoff point for all computable measure-preserving transformations 
with respect to lower semi-computable functions. 

The next two sections will be dedicated to a discussion of the techniques 
we will use in our construction. Section [2] contains a description of the 
type of partial transformations we will use to construct the transformation 
T mentioned above, and Section [3] discusses our methods for building new 
partial transformations that extend other such transformations. We combine 
the material from these two sections to prove our main theorem in Section HI 
while Section[5]contains a further extension of our work and some speculative 
material on a more relaxed form of upcrossings. Section is a general 
discussion of algorithmic randomness intended for ergodic theorists. 



2. Notation and Diagrams 

We will build computable transformations T : 2 aJ — > 2 W using computable 
functions T : 2 <LU -)■ 2 <UJ such that (1) a C r implies T(cr) C T(r) and 
(2) T(x) = lim 

n ^oo T(x \ n) is defined and infinite for all x £ 2 W outside a 
computable Q§ set with measure 0. 

We will approximate such a T by partial transformations: 
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Definition 2.1. A partial transformation is a computable function T : 
2<w 2 <u) such that if a C r and T(r) is defined then T(cr) is defined 
and T(o-) C T(r). We write T C T' if for all a, T(a) C T'(<t). 

We will be exclusively interested in partial transformations which are 
described finitely in a very specific way: 

Definition 2.2. A partial transformation T is proper if there are finite sets 
T_,T+ such that: 

• T_ U T + is prefix-free, 

• U CT6 T_UT + H = 2 W , 

• If there is a r C such that r G T_ then T(<r) = T(r), 

• If <T = r^p with t G T + then T(<r) = T(r)' -N p, 

• If G T_ then |T(er)| < |cr|, 

• If <7 G T + then \T(a)\ = \a\, 

• If a G T+ and cr ^ r G T+ U T_ then T(r) 2 T(<r). 

We say a is determined in T if for some such T_ , T + , some initial segment 
of a belongs to T_ U T+. 

The roles of T_ and T + will be clearer when we introduce a diagrammatic 
notion for describing transformations. For now, note that a proper trans- 
formation is defined by the finite sets T_ and T + together with the finitely 
many values of T on these sets. 

Throughout this paper, T is always assumed to be proper and measure 
preserving. 

We will use the method of cutting and stacking, which was introduced by 
Chacon to produce dynamical systems with specific combinatorial properties 
[1, 0(3- One tries to construct a dynamical system, usually on the real inter- 
val [0,1], by specifying the transformation in stages. At a given stage, the 
interval has been "cut" into a finite number of components, some of which 
have been "stacked" into "towers" or "ladders." A tower is read upwards, 
so the interval on the bottom level is mapped by the transformation to the 
level above, and from that level to the level above that. On the top level of 
a tower, the transformation is not yet defined. To produce the next stage, 
the towers are cut into smaller towers and further stacked. By manipulating 
the order in which the components are stacked, specific properties of the 
transformation can be enforced. This method has been extensively used in 
ergodic theory and probability theory to construct examples with specific 
properties (some overviews of the area are U, HI, 23]). 



A typical diagram is shown in Figure [TJ This figure represents that \o~\ < 
|°o| = = |°2| = | c"3 1 = 1 04 1 and that for all v, T(ai^v) = 0j+i""i; for 
i < 4, T(a^v) = 0, and similarly T(rj' -N f) = T(ri+\^v) for i < 2 while 
T{t2) = (}. Although it is not essential to interpret the diagrams, we will try 
to be consistent about the scale of blocks; in Figure [lj the relative width of 



1 Actually, according to [ll]], the method was first used several decades earlier by von 
Neumann and Kakutani, but not published until later fljl ]. 



6 



JOHANNA N.Y. FRANKLIN AND HENRY TOWSNER 



a 
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0"3 



0"2 



0"l 



GO 



TO 



Figure 1. A typical diagram 



the blocks suggests that |Tj| = |<7j| + l — that is, /u([rj]) = /x([<7j])/2; the height 
of a block does not represent anything, so we draw each blocks with the same 
height. The only relevant dimensions are the widths of the blocks and the 
numbers of blocks in the towers. In fact, proper partial transformations can 
always be represented by diagrams where all blocks have the same width, 
but it is useful to consider intermediate diagrams where blocks vary in width. 

In general, a block represents a subset of 2 W of the form [a] for some 
sequence a; by placing the block corresponding to [a] on top of the block 
corresponding to [r], we are indicating that r G T + and T(r) = a — that 
is, that in the transformation we construct extending T, T([r]) = [a]. (We 
must, therefore, have \a\ = \t\.) By placing some sequence a' with \a'\ < \a\ 
on top of the block corresponding to [<r], we are indicating that a € T_ 
and T(a) = a' — that is, in the transformation we construct extending T, 
T([a])C[a'}. 

The roles of T_ and T + in the specification of a proper transformation 
are now clearer: the elements of T__ U T + are the particular blocks labeled in 
a given diagram; the elements r G T + are those blocks which have another 
block on top, and therefore T is completely defined on every element of [r]. 
The elements r G T_ are topmost blocks of some tower, for which we have 
(at most) partial information about T on [t\. 

We will only be concerned with partial transformations satisfying two 
additional properties. 

Definition 2.3. An open loop in a partial transformation T is a sequence 
o"o, . . . , a n such that: 

• ko| = |fl| = • • • = \°~n\, 

• T(<Tj) = (jj+i for i < n, 

• T{a n ) C a . 

The width of a loop is the value 2"l°" l l, and the volume of a loop is n2~l CTl L 
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We say T is partitioned into open loops if for every a there is an open 
loop do, . . . ,o~ n in T with = 0% for some i. (In a proper transformation 
such a loop must be unique.) In such a transformation we write Ct(o~) for 
the loop <7o, . . . ,a n such that for some i, a = Oi. We write ir{a) for this 
value of i. 

We say t is blocked if there is any ex such that T(a) □ r. Otherwise we 
say r is unblocked. 

(We are interested in open loops to preclude the possibility that T(a n ) = 
do, since we are not interested in — indeed, will not allow the existence of — 
"closed" loops.) Diagrammatically, the requirement that T be partitioned 
into open loops is represented by requiring that any sequence written above 
a tower of blocks is a subsequence of the sequence at the bottom of that 
tower. (For instance, in Figure [H we require that a C gq.) 

Definition 2.4. An escape sequence for <r in T is a sequence Co, ■ • • , o~ n such 
that: 

• (T = a, 

• \°~l\ = \°~2\ = . . . = |<T„|, 

• For all < i < n, <Tj+i □ T(o~i), 

• If cr.j+1 is blocked then <r.; + i = T(<Ti), 

• T(a n ) = (), 

• All Oi are determined. 

We say T is escapable if for every determined a with |T(cr)| < |er|, there 
is an escape sequence for a. 

An escape sequence is reduced if the following two conditions hold: (1) 
o~i 5 T(o~j) implies that either % < j + 1 or <Tj is blocked, and (2) if i < n, 
then T{ui) ^ (). 

Escapability preserves the option of extending T in such a way that we 
can eventually map [o~o\ to anything not already in the image of another 
sequence (although it may require many applications of T). 

Lemma 2.5. Every escape sequence for a contains a reduced subsequence 
for a. 

Proof. We proceed by induction on the length of the sequence. It suffices 
to show that if <to , . . . , o~ n is a nonreduced escape sequence then there is a 
proper subsequence which is also an escape sequence for a®. If for some 
i > j + 1, <Tj 3 T(o-j) with dj unblocked, then ctq, . . . , &j, o~i, . . . , a n is also 
an escape sequence. If for some i < n, T(oi) = () then <to, . . . , cr, is also an 
escape sequence. □ 

Lemma 2.6. If <tq, . . . , <r n is an escape sequence for <to in T then for every 
p, do, (Ti^/O, . . . , cr^p is also an escape sequence. 

Lemma 2.7. Suppose o~o,...,ak is an open loop in T consisting of de- 
termined elements such that for all v, ctq E T(v) implies ctq = T(v) and 
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To,...,T n is a reduced escape sequence for To. Then one of the following 
occurs: 

• r □ cr k and for j > 0, Tj £ Uj< fe [<7j], 

• There is a unique j > such that for all i < k, Tj + i □ Oi, 

• For all j, tj £ Uj< fe [<Ti]. 

Proof. First, suppose some Tj □ 0{. If j" = then since |T(ro)| < |to | , we 
must have i = k. If j ^ and i ^ then since o~i is blocked, we must have 
Tj-i □ 0-j_i. Since ro 2 fi-i) we can repeat this and conclude that j > i 
and Tj-i □ do. Furthermore, for each z' < fc, Tj_j + j' □ ay; we have already 
shown this for i' < i, and for i' > i it follows since T , (r ? ) □ Ci+i, and so on. 

So we have shown that if Tj □ (Tj for some i, j with j > then we have 
a complete copy of the loop in our escape sequence. We now show that 
if j < j' and Tj □ cr^ then we cannot have Tj> □ (Tj; this shows both the 
second half of the first case and the uniqueness in the second case. For 
suppose we had Tj □ a^, j' > j, and Ty □ (Tj. By the previous paragraph, 
we may assume i = k. But since is determined and |T(cjfc)| < a/,, we 
have T{Tj) = T(a k ) = T(jji). But then we either have T(r J /) = () or 
T(Tj) □ Tj + i; in either case the sequence is not reduced, contradicting our 
assumption. □ 

3. Pieces of the Construction 

In this section we describe certain modifications of partial transformations 
and show that they preserve certain essential properties. First, we lay out 
the technical properties of the transformations we need to construct. 

Definition 3.1. A partial transformation T is useful if: 

• T is proper, 

• T is partitioned into open loops, and 

• T is escapable. 

It will be helpful to keep track of the following quantity: 

Definition 3.2. For any a, the burden of a in T, bd,T(cr), is Si<fcA t ([ <T ]) = 
/c2~' CT ! where k is the length of Ct{o~). 

Lemma 3.3 (Thinning Loops). LetT be a useful partial transformation, let 
<To,...,(Jk be an open loop of determined elements such that if r G T_ U T + 
then T(r) o-q, and let e = 2~ n be smaller than the width of this loop. Then 
there is a useful T' 3T such that: 

• There is a loop ro,...,7V in T' of width e such that Uj<fc/[rj] = 
Uj<fc[erj], 

• IfrfjL Ui< k [ai] then T'{t) = T(r). 

Furthermore, the burden of a in T is the same as the burden of a in T' for 
every a. 
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Figure 2. Thinning Loops, Lemma 



Proof. Figure [2] illustrates this lemma. Formally, let the width of o~o, . . . , o~k 
be 2~ m with m < n. Define T' ^ T by: 

• If r = o~]f~ p where \v\ = n — m and v is not all l's then T'{t) = 

• Otherwise T'(r) = T(r). 

Note that T C T" implies that T'(crf~*p) = <Ji+\" p for i < k. 

Propriety and the fact that T' is partitioned into loops are trivial. To see 
escapability, consider some r determined such that \T'(t)\ < \t\ and fix a 
reduced escape sequence To, . . . , r r in T. If r G U[crj] then by Lemma I27T1 we 
have r □ cifc and no other element of the escape sequence belongs to U[o"j], 
and therefore To, . . . , r r is an escape sequence in T' as well. 

If r U[<jj] but there is a j > such that Tj+j □ Oi for j < k then 
by Lemma 12.61 we may assume that for j > 0, \tj\ > n. Then since both 
T(t 7 _i) C Tj and o"o E we must have T(rj_i) C do, and therefore for any 
p of suitable length, the sequence 



TO, ■■■ ,Tj-l, (70^(0, ... , 0}">, ... ,0-^(1, ... , 1)^/3, Tj+fc+1, 

is an escape sequence for r in T'. 



□ 



We also need a modified version of the above lemma where instead of 
wanting Uj<&/[Tj] = Uj<fc[trj] we want to have a small amount of the original 
loop left alone. 

Lemma 3.4. Let T be a useful partial transformation, let o~o, . . . ,o~k be an 
open loop of determined elements such that if t G T_ U T + then T(r) 7$ o~q, 
and let e = 2~ n be smaller than the width of this loop. Then there is a useful 
T' ~D T with the following properties: 

• There is a loop To,---,Ty in T' of width e such that (Uj<fc/[rj]J \ 
(Ui< fe [cTi]) = e • k, 

• Ifr<jL Ui< k '[Ti\ then T'(t) = T(t), 
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• 1/r ^ Uj<fc/ [Tj] and r has an escape sequence in T then r has an es- 
cape sequence in T' which does not contain any element o/Uj<fc'[rj]. 

Furthermore the burden of a in T is the same as the burden of a in T' for 
every a. 

Proof. We proceed exactly as above except that we leave the strip of (1, . . . , 1) 
untouched and leave T'(ak^{l, . . . , 1, 0)) = T(o-fc). □ 

Lemma 3.5 (Lockstep Escape). Let T be a useful partial transformation 
and let o"o,...,(T^ be a reduced escape sequence such that \o~o\ = \o~i\ + 1. 
Suppose that T_,T + witness that T is proper, and define 

T' + = T+ U {a } U {ar (0) | i > 0}. 

Setting T'(ao) = <ti^(0), and for i > 0, T' {af^ (0)) = cjj + i'~ N (0) fully 
specifies a partial transformation T' D T. Then T' is useful and if a 
Ui<fc[o"j] U L)j<k>[Tj], where TQ,...,Tp.i is the open loop containing gq, then 
the burden of a is unchanged. 

Proof. To see that T' is proper, we note that we have specified T' + and 
(implicitly) T'_ , and we need only check that if a G T' + and a ^ r € T' + U T'_ 
then T'(t) 2 T'(a). Clearly we need only check this for T(a) = cjj'^(O). 
Since the escape sequence was reduced, we cannot have Oi = Oj for i ^ j, 
so we can restrict our attention to the r such that T'(r) = T(r). If cij was 
not blocked in T then there is no such r, and if cij was blocked in T then 
already T(cjj„i'^(0)) = £7^(0), and the claim follows since T was proper. 

It is easy to see that T" remains partitioned into open loops. 

Finally we check that T' is escapable. Let r determined be given with 
|T'(t)| < \t\. Then the same was true in T, so r had an escape sequence 
To, ...,T r in T. We may assume |ti| > \a\\. There are a few potential 
obstacles we need to deal with. First, it could be that for some i, Tj □ a®. 
Letting Tj = o-Q^p, we must have To, ... , ctq^p, (0)^ p, . . . , ak^(0)^p is 
also an escape sequence for To. 

Otherwise, there could be some i and j > such that Tj □ o~j. If Tj = o~j 
then to, Ti^(l), . . . , T r ^(l) is also an escape sequence in T and remains one 
in T". If Tj = aj^(l)^p, the same escape sequence works in T". Otherwise, 
replacing each such Tj with aj^(l)^p gives a new escape sequence in T 
which remains one in X". □ 

The main building block of our construction will combine these two steps 
as illustrated in Figure [3j The next section is devoted to explaining this 
process. 

4. The Main Construction 

Our main tool for causing the Birkhoff ergodic theorem to fail at a point 
is the notion of an upcrossing. 
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Definition 4.1. Given a measurable, measure-preserving, invertible T : 
2 W — >■ 2 W , a point x € 2 W , a measurable /, and rationals a < f3, an upcrossing 
sequence for a, f3 is a sequence 

< Ul < fl < «2 < V2 < ■ ■ ■ < UN < Vpj 

such that for all % < N, 



u. 



£ f{T*x) < a, -L- £ /( T%) > /3. 



j=0 j=0 

t(x, f,a, (3) is the supremum of the lengths of upcrossing sequences for 
a, (3. 

By definition, Birkhoff 's ergodic theorem fails at x exactly if r(x, /, a, (3) = 
oo for some a < (3. Our plan is to look at an Martin-L6f test (Vj) and, as 
sequences a are enumerated into an appropriate Vj, ensure that the lower 
bound on r(x, /, 1/3, 1/2) increases for each x £ [cr]. 

Theorem 4.2. Suppose x £ 2^ is not Martin-Ldf random. Then there is a 
computable set A and a computable transformation T : 2 U — >■ 2 U such that x 
is not typical with respect to the ergodic theorem. 

Proof. Let (Vj) be a Martin-L6f test witnessing that x is not Martin-L6f 
random, so x G C^jVj. We will construct an increasing sequence of useful 
partial transformations To C T\ C T2 C • • • so that T = U n T„ will be 
the desired transformation. We will maintain, at each stage, a computable 
partition of 2 W into clopen components W n , and for each value k, we will 
maintain components and B% with the requirement that it is always 
possible to extend T n such that it maps each component to itself. (Since the 
partition components are all clopen sets, we will also treat these sets as a 
partition of 2- m for m sufficiently large.) 

W n represents the "work area" ; initially W° will be a portion of 2 W known 
to contain x. In later stages, W n will grow to include parts of the A 1 ? and 
B% which have been used. A = UkA® will be the set which will demonstrate 
the failure of the ergodic theorem for x. Our strategy will then be that 
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when we discover elements in V n for appropriate n, we will arrange for the 
transformation to eventually map those elements through A for a long time, 
ensuring that the average membership in A reaches 1/2. We will then have 
the transformation map those elements through B = Ufc-Bj? for a long time 
to bring the average down to 1/3. We will do this to each element of DjVj 
infinitely many times, ensuring that elements in this intersection are not 
typicalo A k is the section of A reserved for making the average large for the 
(k + l) st time, and B k is the section reserved for making the average small 
again after. AV, and B2 represent the portions still available at stage n after 
some parts have been used. We also keep track of constants a k < fJ-(A k ) and 
frfc ^ A*(-^fe)) which represent how much of AV, and BV; have already been 
committed but not yet used. 

Finally we have a partition W n = U^WJ}, where elements of W k are 
those which are already guaranteed in stage n to have k upcrossings, and a 
function p n : T + U T_ — > N, where V pn ^ is the element of our test set we 
will be watching to discover which elements of [a] require a new upcrossing. 

Initially, we assume without loss of generality that x belongs to some 
clopen set W° with fi(W°) < 1. For instance, we can suppose we know the 
first bit of x and let W° equal [0] or [1] as appropriate. Then, from the 
remaining measure, we take A° k and B° k so that p{B k ) = 2^{A° k ) for all k. 
We set Wq = W° and take To to be the trivial transformation of height 
(i.e., To(cr) = () for all a). Choose j large enough that (i(Vj) < m(^q) an d 
set p°(()) = j. Set ag = fx(Vj), b° = 2/i(Vj), and for k > 0, a° k = b° k = 0. 

We require that all determined members of A k and B k are unblocked. 
Finally, we will maintain a stronger form of escapability: we require that if 
a is determined and |T„(cr)| < \a\, then a belongs to either W n or A k or B k 
for some k, and we require that a has an escape sequence contained entirely 
in the same component. 

Now we proceed by stages. At each even stage n, we take steps to ensure 
that T is defined almost everywhere. Given T n , defined by T+, T_, we define 
T' + = T + and define T'_ by 

TL = U aeT _{a-(0),a-(l)}. 

We then set T n+ i(cr"(0)) = T n (a). Let r be the initial element of the open 
loop containing a in T, so T(a) C r. In particular, there is a b € {0, 1} such 
that T(a)^(b) C r, and we set T n+ i(a^ (1)) = T(a)^(b). Note that this 
process ensures that as long as x € 2 W does not end in cofinitely many 0's, 
T(x) will be defined. 

We now consider the real work. At each odd stage n, we take steps to 
ensure that the ergodic theorem does not hold for any element of r)jVj. We 
may assume that exactly one r is enumerated into LijVj at this stage, and 
that if r is enumerated into Vj at this stage then for each i < j, there 

2 It is not possible to ensure that every element of the set Vj receives j upcrossings, 
since this would imply that the theorem holds for every x which failed to be even Demuth 
random, which would contradict V'yugin's theorem. 
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is a t' C t which was enumerated into Vi at some previous stage. We first 
assume r is determined. If p n (r) ^ j where r was enumerated into Vj, we do 
nothing, so we will assume that p n (r) = j. We have r € WTLi, and we have 
ensured inductively that bd Tn {r) < a% < p{AD and 2bdT n {r) < 6£ < fJ.(B%). 

In Figure HI we illustrate the way we intend to arrange T'. We must 
ensure that every point in [r], a section of fixed total measure, receives a 
new upcrossing. We must do so while ensuring that the total measure of the 
portions of A% used is strictly less than bd? n (r) + (fi(A^) — aJJ) and the total 
measure of the portions of B% used is strictly less than 2bdT n (r) + (n(B^) — 
bu). Finally, the escape sequences will all have a fixed height, which we 
cannot expect to bound in advance. Our solution will be to thin all the 
parts other than the escape sequences until the entire tower is so narrow 
that we can afford the error introduced by the various escape sequences. 

So let n, . . . ,r t be the open loop containing r and let et be the height 
of an escape sequence for Tt- For some finite U and each i < U, let 
Vi G A£ be distinct, incomparable, determined sequences so that p(A^) > 
Yli<u MM) = J2i<t f I ([ T i]) > (Such sequences exist because we 

have ensured that p(A^) > a£ > bdr n (r).) For each vi there is an escape 
sequence contained entirely in A^; let e u be the maximum of the heights of 
these sequences. Now choose a finite V and sequences i/j, i < V, so that 
p(B'k) > ^2i<v t^iWH) ^ ^bdT„(T). (Again, such sequences exist because 
fJ-(B^) > > 46dj' n (r).) For each V{ there is an escape sequence contained 
entirely in B^; let e v be the maximum of the heights of these sequences. 

For each Vi we fix an escape sequence entirely in A?, and for each Vi we 
fix an escape sequence entirely in B^. We now choose N and N' sufficiently 
large to carry out the following argument. We apply Lemma f3.3l to T\, . . . , Tt 
with e = 2~( N+N \ and to each vi and v% we first apply Lemma 13.41 with 
e = 2~ N , and then, letting v\ and v\ be the portions which are thinned, we 
then apply Lemma 13.31 to v[ and v[ with e = 2~ N ' . 

We now take escape sequences of width 2~( N+N '~ 1 ') for the v\ and v^. By 
choosing N large enough, we may ensure that we may choose these sequences 
to be nonoverlapping and that the leftover portions from the application of 
Lemma 13.41 are not completely filled by these portions. We take an escape 
sequence for Tt of the same width. We now apply Lemma 13.51 repeatedly. 

Let T' be the partial transformation we obtain after all these applications. 
Ui<t[ri] is contained in some open loop with final element r* and T'(r*) = (} 
(because r* was the last element of the escape sequence). Similarly [v^] 
and are contained in open loops with initial elements v®,vf and final 
elements v*, v*. Then we define T n+ i(r*) = v$, for i < U, T n+1 (v*) = v$ +1 , 

T n +i(vu) = ^oi and for * < V > T n+l( T i) = T i+1- 

Consider some element x of [r]. Let k = 2^bdT n {T) be the length of the 
open loop in T n containing r. In T n+ \ and any extension of T n+ x, we have 
ensured that x is first mapped through [r] for at most 2 N+N '~^k steps, x 
is then mapped through an escape sequence of length et- x is next mapped 
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through the [wj] and their escape sequences; the Vi account for 

J2 2^+N'-\vi\ _ 2 N'-\vi\ = ( 2 n+n> _ 2^') ^ > 2 N+N 'k2-W + e t 

i<U i<U 

steps since N, N' are sufficiently large. In particular, this means that by 
the time x reaches the end of the [vi], more than half these steps were in 
A% C A. Next, x is mapped through the [z/j] and their escape sequences. 
Since 

^2 2 N+N'-\u i \_ 2 N'-\ Ui \ = > 2 N+N '-2-Y,2- lVtl +Ue. 

i<V i<V i<U 

we have ensured that the number of steps in the [i/j] is twice as large as the 
entire segment before we entered the [vj\. In particular, this ensures that, 
after leaving the [i>i], at most one third of the steps were in A. 

To find A^ + we remove all the [vi] and their escape sequences, and to 
find B^ +1 we remove all the [r,] and their escape sequences. Naturally, we 
add these to W£t{, along with [r] and its escape sequence. Now consider 
any element of the long open loop containing [r], and let 7 be the burden 
of this element. We choose j large enough that fj,(A^ +1 ) — a^ +1 > ^2~ 3 
and (i(B% +1 — b^ +1 ) > 7 • 4 • 2 _J , set p n+1 to be j for every element of this 
open loop, and set ajjjj = a£ +1 + ~f2~i and 5™+} = + 472^. Finally 
a^ +1 = — 7 U where j u is the sum of the measures of the escape sequences 
for the V{, and b^ +1 = 6jJ — 7^ where 7^ is the sum of the measures of 
the escape sequences for the i/j. (We leave unchanged all other objects we 
maintain inductively — for instance, if k' 7^ k then A^ 1 = A^ and so on.) 

This completes the construction of T n+ \ and all associated objects in the 
case where we enumerate a single determined element into UjVj, as well as 
the proof that the construction is valid in that case. 

We must confront one final complication: it may be that r is not deter- 
mined. In this case we would like to split [r] into a union Uj-[t 3 ] where the t 3 
are determined. To do so, we apply the construction just described to each 
t j repeatedly. If multiple t 3 belong to the same open loop, we treat them 
simultaneously since the construction above applies to the entire open loop 
containing [r] . The only possible source of interference is that we may alter 
the escape sequences of one t 3 while applying the construction to another 
one. However we note that we do not need any sort of uniform bound on 
the length of escape sequences, and therefore this is not an obstacle. 

□ 



5. Upcrossings 

Throughout this section, we will take T to be a computable, measure- 
preserving transformation. 

Recall the following theorem of Bishop's 0]: 
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Escape sequences ( for 
the portions of B2- 



Escape sequences for 
the portions of A%. 



An escape sequence for r 



Portions of B^ with total area 
slightly larger than &dy(r). 



Portions of A£ with total ell cr 
slightly larger than 6dr(r). 



The open loop containing r 



Figure 4. Construction of T 



Theorem 5.1. 

J i~(x, f, a, j3)dx < J U ~ a) + dx. 

This is easily used to derive the following theorem of V'yugin: 

Theorem 5.2 ([iijl). If x is Martin-Lof random and f is computable then 
lim^oo ^ YTj=o f( Tjx ) converges. 

Proof. Suppose lim^oo ^j-j- Y^j=o fi^x) does not converge. Then there 
exist a < ft such that Sj=o f{T^x) is infinitely often less than a and 
also infinitely often greater than j3. Equivalently, t(x, f, a, (3) is infinite. But 
observe that when / is computable, r(x, f, a, 0) is lower semi-computable, 
so in particular, 

V n = {x\ r(x,f,a,P) > n} 
is computably enumerable and n(V n ) < n ^_ a ^ f(f~ a) + dx. Therefore an 
appropriate subsequence of (V n ) provides a Martin-Lof test, and x € Pi n V n , 
so x is not Martin-Lof random. □ 

We now consider the case where / is lower semi-computable. We will have 
a sequence of uniformly computable increasing approximations fi f, and 
we wish to bound the number of upcrossings in /. The difficulty is that 
t(x, fi, a, j3) is not monotonic in i: it might be that an upcrossing sequence 
for fi ceases to be an upcrossing sequence for /j+i. 
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In order to control this change, we need a suitable generalization of up- 
crossings, where we consider not only the up crossings for /, but for all 
functions between / and / + h where h is assumed to be small. 

Definition 5.3. A loose upcrossing sequence for a,f3,f,h is a sequence 

< U\ < V\ < U2 < v-i < ■ ■ ■ < UN < vn 

such that for all % < N, 



1 



y 



f(T j x) < a, 



1 



Vi 

y 



(f + h)(T*x)>l3. 



3=0 3=0 

v(x, f,h,a, (3) is the supremum of the lengths of loose upcrossing se- 
quences for a, (3, /, h. 

Loose upcrossings are much more general than we really need, and so the 
analog of Bishop's theorem is correspondingly weak. For instance, consider 
the case where T is the identity transformation, f = xa, and h = \B with 
A and B disjoint (so / + h = xaub)- Then v(x, f,h,a, (3) = oo whenever 
< a < (3 < 1. Nonetheless, we are able to show the following: 

Theorem 5.4. Suppose h > 0, J h dx < e and (3 — a > 5. There is a set A 
with fJ-(A) < Ae/8 such that 



JX\A 



v(x, /, h, a, (3)dx 



is finite. 

Proof. By the usual pointwise ergodic theorem, there is an n and a set A 1 
with n{A') <2e/5 such that if x £ A' then for all n', n" > n, 



3=0 3=0 

Consider those x A 1 such that, for some n' > n, 



< 5/2. 



1 n 
—— y h(T j x) > 5. 
n' + 1 4rf 



3=0 



We call this set A". Then for all n' > n, such an x satisfies 

1 



and in particular, 



nl + i y j h{Tix)>5/2, 

3=0 



hdx> 5n(A")/2. 



Therefore n{A") < 2e/S. If we set A = A' U A", we have fi(A) < 4e/5. 

Now suppose x A. We claim that any loose upcrossing sequence for 
a, f3, f, h with n < u\ is already an upcrossing sequence for a, f3 — 5. If n < 
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u\ < v\ < ••• <ujv<WArisa loose upcrossing sequence, we automatically 
satisfy the condition on the Uj. For any vi, we have 



j=0 j=0 3=0 

Since ^ £JL /i(T'z) < (5, it follows that ^ £Jl /C^'a;) > P ~ ° as 
desired. Therefore 

/ v(x, f,h,a, (3)dx < /j,(X \ A) / n + t(x, f,a, f3 — 8)dx 
Jx\A Jx\A 

is bounded. □ 

Theorem 5.5. // x is weakly 2-random and f is lower semi- computable 
then linin^oo YJj=o f(T jx ) converges. 

Proof. Suppose hin^^oo ^-j- Y0j=o f{^x) does not converge. Then there 
exist a < (3 such that ^-j- Y^=o i s infinitely often less than a and 
also infinitely often greater than j3. Equivalently, r(x, /, a, (3) is infinite. 
Let f n — > f be the sequence of computable functions approximating / from 
below. 

For each n, we set 

V n = {x\3m>n v(x, /„, f m - f n , a, (3) > n). 

By construction, x G n n V n . To see that V n +i C observe that if 

v(x,f n+1 ,f m - f n+1 ,a,/3) > n + 1 

then there is a loose upcrossing sequence witnessing this, and it is easy 
to check (since the f n are increasing) that this is also a loose upcrossing 
sequence witnessing 

V{x, f n , f m - fn,a,f3) > n + 1 > 71. 

We must show that /J,(V n ) — > 0. Fix <5 < (3 — a and let e > be given. 
Choose 7i to be sufficiently large that \\f n — f\\ < Se/4. Then, since the /„ 
approximate / from below, clearly v(x,f m ,f m > - f m ,a,/3) < v(x,f m ,f - 
f m , a, (3) for any ml > m. By the previous theorem, there is a set A with 
n(A) < e/2 such that j x ^ A v(x, f m , f — f m , a, (3)dx is bounded. We may 
choose n' > n sufficiently large that 

B = n({x A | v(x, f m , f - fm, a, 13) > n'}) < e/2. 

Then V n > C A U B, so n(V n i) < e. □ 
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5.1. Room for Improvement. It is tempting to try to improve Theorem 
15.41 The premises of that theorem are too general and the proof is oddly 
"half-constructive" — we mix the constructive and nonconstructive pointwise 
ergodic theorems. One would think that by tightening the assumptions and 
using Bishop's upcrossing version of the ergodic theorem in both places, we 
could prove something stronger. 

In the next theorem, we describe an improved upcrossing property which, 
if provable, would lead to a substantial improvement to Theorem 15 .5t bal- 
anced randomness would guarantee the existence of this limit. (Recall that 
a real is balanced random if it passes every balanced test, or sequence (Vi) of 
r.e. sets such that Vi = Wfrn for some 2 n -r.e. function / and < 2~ l 

for every i Q.) The property hypothesized seems implausibly strong, but 
we do not see an obvious route to ruling it out. 

Theorem 5.6. Suppose the following holds: 

Let f and e > be given, and let < ho < hi < ■ ■ ■ < h n be 
given with \\h n \\L°° < e. Then 

/ supr(x,/ + h n ,a,/3)dx < c([|/||l«>, e) 

JX n 

where c(||/||L°o,e) is a computable bound depending only on 
||/j| L oo and e. 

Then whenever x is balanced random and f is lower semi- computable then 
lim^oo ^ YJj=o fi T:, x) converges. 

Proof. We assume ||/||/,2 < 1 (if not, we obtain this by scaling). Suppose 
lim n _ >00 ^j-j- 5Zj=o / '(^ x ) does not converge. Then there exist a < (3 such 
that ^J-j- Y^j=o i s infinitely often less than a and also infinitely often 

greater than f3. Equivalently, t(x, f,a, ft) is infinite. Let f n — > f be the 
sequence of computable functions approximating / from below. 
We define the set 

V(n,k) = {x\3m>n t(x, f m , a, /3) > k}. 

We then define the function g(n, n') to be least such that Vm € [n, n'] \\f n > — 
fm\\< 2~ n and g(n) = lim ra / g(n, n'). Since the sequence f m converges to / 
from below, g(n) is defined everywhere, and \{s \ g(n, s + 1) ^ g(n, s)}\ < 2 n 
for all n. Indeed, g(n) is the least number such that Vm > g{n) \\f — f m \ \ < 
2~ n . 

Observe that /i(V( n fe )) < c( -^^ L ^°' 2 — -. Choose h(n) to be a computable 

function growing quickly enough that c ^^^y 2 — - < 2~ n for all n. If x £ 

V(g(n+l),h(n+ 1 )) ^ en there is some m > g{n + 1) so that r(x,/ m ,a,/3) > 
h(n + 1). Since g(n + 1) > g(n) and h(n + 1) > h(n), we also have that 
x € V( g {n),h{n))- Therefore {V^^Nn))) is a balanced test. 
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But since t(x, f, a, 0) is infinite, we must have x G ftV(g(n),h(n))- This con- 
tradicts the assumption that x is balanced random, so lim n _ J . 00 X^j=o fffix) 
converges. □ 

In fact, the test (V(g(n)Mn))) nas an additional property: if so < si < ^2 
with g(n + 1, s ) / g(n + 1, si) / g(n + 1, s 2 ) then s ) / g(n, s 2 ). This 
means that {V( g t n )Mn))) is actually an Oberwolfach test [2j, and so we can 
weaken the assumption to x being Oberwolfach random. 

6. Discussion for Ergodic Theorists 

In the context of analytic questions like the ergodic theorem, matters 
of computability are mostly questions of continuity and uniformity: the 
computability of a given property usually turns on whether it depends in an 
appropriately uniform way on the inputs. Algorithmic randomness gives a 
precise way of characterizing how sensitive the ergodic theorem is to small 
changes in the underlying function. 

The paradigm is to distinguish different sets of measure 0, viewed as 
an intersection A = H^A,, by characterizing how the sets Aj depend on 
the given data (in the case of the ergodic theorem, the function /). The 
two main types of algorithmic randomness that have been studied in this 
context thus far are Martin-L6f randomness and Schnorr randomness. In 
both cases, we ask that the sets Aj be unions Aj = UjAj j of sets where Aij 
is determined based on a finite amount of information about the orbit of / 
(in particular, the dependence of Ajj on / and T should be continuous). (To 
put it another way, we ask that the set of exceptional points which violate 
the conclusion of the ergodic theorem be contained in a G$ which depends 
on / in a uniform way.) The distinction between the two notions is that in 
Schnorr randomness, n(Aj) = 2~ l , while in Martin-L6f randomness, we only 
know fJ,(Ai) < 2~ l . This means that in the Schnorr random finite 
amount of information about the orbit of / suffices to limit the density of Aj 
outside of a small set (take J large enough that n(Uj<jAij) is within e of 
2~ 4 ; then no set disjoint from Uj<jAjj contains more than e of A4). In the 
Martin-L6f random case, this is not possible: if n(Ai) < 2~' — e, no finite 
amount of information about the orbit of / can rule out the possibility that 
some Aij with very large j will add new points of measure e. In particular, 
while we can identify sets which do belong to Aj, finite information about 
the orbit of / does not tell us much about which points are not in Aj. 

The two classes of functions discussed in this paper are the computable 
and the lower semi-computable ones; these are closely analogous to the con- 
tinuous and lower semi-continuous functions. Unsurprisingly, both the pas- 
sage from computable to lower semi-computable functions and the passage 
from ergodic to nonergodic transformations make it harder to finitely charac- 
terize points violating the conclusion of the ergodic theorem. Perhaps more 
surprising, both changes generate precisely the same result: if a point vio- 
lates the conclusion of the ergodic theorem for a computable function with 
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a nonergodic transformation, we can construct a lower semi-computable 
function with an ergodic transformation for which the point violates the 
conclusion of the ergodic theorem, and vice versa. 

The main question we leave open is what happens when we make both 
changes: what characterizes the points which violate the conclusion of the 
ergodic theorem for lower-semi computable functions with nonergodic trans- 
formations? The answer is likely to turn on purely ergodic theoretic ques- 
tions about the sensitivity of upcrossings, such as the hypothesis we use 
above. 

Question 6.1. Let (X,fi) be a metric space and letT : X — > X be measure 
preserving. Let e > be given. Is there a bound K ( depending on T and on 
e) such that for any f with ||/||l°o < 1 and any sequence < ho < h± < 
■ ■ ■ < h n with \ \h n \\L°° < e, 



where t(x, g,a, (3) is the number of upcrossings from below a to above j3 
starting with the point x ? 
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